Bayes in Wonderland! Predictive Supervised Classification Inference Hits Unpredictability

نویسندگان

چکیده

The marginal Bayesian predictive classifiers (mBpc), as opposed to the simultaneous (sBpc), handle each data separately and, hence, tacitly assume independence of observations. Due saturation in learning generative model parameters, adverse effect this false assumption on accuracy mBpc tends wear out face an increasing amount training data, guaranteeing convergence these two under de Finetti type exchangeability. This result, however, is far from trivial for sequences generated Partition Exchangeability (PE), where even umpteen does not rule possibility unobserved outcome (Wonderland!). We provide a computational scheme that allows generation PE. Based that, with controlled increase we show sBpc and mBpc. underlies use simpler yet computationally more efficient instead simultaneous. also parameter estimation giving rise partition exchangeable sequence well testing paradigm equality across different samples. package supervised classifications, hypothesis Ewens sampling formula deposited CRAN PEkit package.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supervised Naive Bayes Parameters

Bayesian network models are widely used for supervised prediction tasks such as classification. The Naive Bayes (NB) classifier in particular has been successfully applied in many fields. Usually its parameters are determined using ‘unsupervised’ methods such as likelihood maximization. This can lead to seriously biased prediction, since the independence assumptions made by the NB model rarely ...

متن کامل

Predictive minimum Bayes risk classification for robust speech recognition

This paper presents a new Bayes classification rule towards minimizing the predictive Bayes risk for robust speech recognition. Conventionally, the plug-in maximum a posteriori (MAP) classification is constructed by adopting nonparametric loss function and deterministic model parameters. Speech recognition performance is limited due to the environmental mismatch and the ill-posed model. Concern...

متن کامل

Detecting Concept Drift in Data Stream Using Semi-Supervised Classification

Data stream is a sequence of data generated from various information sources at a high speed and high volume. Classifying data streams faces the three challenges of unlimited length, online processing, and concept drift. In related research, to meet the challenge of unlimited stream length, commonly the stream is divided into fixed size windows or gradual forgetting is used. Concept drift refer...

متن کامل

Enhancing Supervised Terrain Classification with Predictive Unsupervised Learning

This paper describes a method for classifying the traversability of terrain by combining unsupervised learning of color models that predict scene geometry with supervised learning of the relationship between geometric features and traversability. A neural network is trained offline on hand-labeled geometric features computed from stereo data. An online process learns the association between col...

متن کامل

The Bayes Inference Engine

We are developing a computer application, called the Bayes Inference Engine, to provide the means to make inferences about models of physical reality within a Bayesian framework. The construction of complex nonlinear models is achieved by a fully object-oriented design. The models are represented by a data-flow diagram that may be manipulated by the analyst through a graphicalprogramming enviro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2022

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math10050828